SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Tryggvason, Karl 
Kestila, Mar jo 
Lenkkeri, Ulla 
Mannikko, Minna 

(ii) TITLE OF INVENTION: Nephrin Gene and Protein 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: McDonnell Boehnen Hulbert & Berghoff 

(B) STREET: 300 S. Wacker Drive, Suite 3200 

(C) CITY: Chicago 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60606 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Chao, Mark 

(B) REGISTRATION NUMBER: 37,293 

(C) REFERENCE /DOCKET NUMBER: 97,842 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (312)913-0001 

(B) TELEFAX: (312)913-0002 



(2) INFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4285 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: sigjeptide 

(B) LOCATION: 1. .66 

(D) OTHER INFORMATION: /note== "putative signal peptide" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3723 

(ix) FEATURE: 

(A) NAME/KEY: matjeptide 

(B) LOCATION: 67.. 3723 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 121.. 122 

(D) OTHER INFORMATION: /note= "deletion mutation 
FIN-Major" 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3800.. 3804 

(D) OTHER INFORMATION: /note= "nonsense mutation in exon 
26 FIN-Minor" 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3178.. 3258 

(D) OTHER INFORMATION: /note= "putative transmembrane 

domain" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATG GCC CTG GGG ACG ACG CTC AGG GCT TCT CTC CTG CTC CTG GGG CTG 
Met Ala Leu Gly Thr Thr Leu Arg Ala Ser Leu Leu Leu Leu Gly Leu 
-22 -20 -15 

CTG ACT GAA GGC CTG GCG CAG TTG GCG ATT CCT GCC TCC GTT CCC CGG 
Leu Thr Glu Gly Leu Ala Gin Leu Ala He Pro Ala Ser Val Pro Arg 
-5 1 5 10 

GGC TTC TGG GCC CTG CCT GAA AAC CTG ACG GTG GTG GAG GGG GCC TCA 
Gly Phe Trp Ala Leu Pro Glu Asn Leu Thr Val Val Glu Gly Ala Ser 
15 20 25 

GTG GAG CTG CGT TGT GGG GTC AGC ACC CCT GGC AGT GCG GTG CAA TGG 
val Glu Leu Arg Cys Gly Val Ser Thr Pro Gly Ser Ala Val Gin Trp 
30 35 40 

GCC AAA GAT GGG CTG CTC CTG GGC CCC GAC CCC AGG ATC CCA GGC TTC 
Ala Lys Asp Gly Leu Leu Leu Gly Pro Asp Pro Arg He Pro Gly Phe 
45 50 55 

CCG AGG TAC CGC CTG GAA GGG GAC CCT GCT AGA GGT GAA TTC CAC CTG 
Pro Arg Tyr Arg Leu Glu Gly Asp Pro Ala Arg Gly Glu Phe His Leu 
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60 



65 



70 



CAC ATC GAG GCC TGT GAG CTC AGO GAT GAG GCG GAG TAT GAG TGC CAG 
His He Glu Ala Cys Asp Leu Ser Asp Asp Ala Glu Tyr Glu Cys Gin 
75 80 85 90 

GTC GGC CGC TCT GAG ATG GGG CCC GAG CTC GTG TCT CCC AGA GTG ATC 
Val Gly Arg Ser Glu Met Gly Pro Glu Leu Val Ser Pro Arg Val He 
95 100 105 

CTC TCC ATC CTG GTT CCT CCC AAG CTG CTC CTG CTG ACC CCA GAG GCA 
Leu Ser He Leu Val Pro Pro Lys Leu Leu Leu Leu Thr Pro Glu Ala 
110 115 120 

GGC ACC ATG GTC ACC TGG GTA GCT GGG CAG GAG TAC GTG GTC AAC TGT 
Gly Thr Met Val Thr Trp Val Ala Gly Gin Glu Tyr Val Val Asn Cys 
125 130 135 

GTG TCT GGG GAC GCG AAG CCA GCA CCT GAC ATC ACC ATT CTC CTG AGT 
Val Ser Gly Asp Ala Lys Pro Ala Pro Asp He Thr He Leu Leu Ser 
140 145 150 

GGA CAG ACA ATA TCT GAC ATC TCT GCA AAC GTG AAC GAG GGC TCC CAG 
Gly Gin Thr He Ser Asp He Ser Ala Asn Val Asn Glu Gly Ser Gin 
155 160 165 170 

CAG AAA CTC TTC ACT GTG GAG GCC ACA GCC AGG GTG ACA CCC CGG AGC 
Gin Lys Leu Phe Thr Val Glu Ala Thr Ala Arg Val Thr Pro Arg Ser 
175 180 185 

TCA GAT AAT AGG CAG TTG CTG GTC TGT GAG GCG TCT AGC CCA GCA CTG 
Ser Asp Asn Arg Gin Leu Leu Val Cys Glu Ala Ser Ser Pro Ala Leu 
190 195 200 

GAG GCC CCC ATC AAG GCC TCA TTC ACC GTG AAT GTT CTG TTC CCT CCA 
Glu Ala Pro He Lys Ala Ser Phe Thr Val Asn Val Leu Phe Pro Pro 
205 210 215 

GGA CCC CCT GTC ATC GAG TGG CCA GGC CTG GAT GAG GGG CAC GTG CGG 
Gly Pro Pro Val He Glu Trp Pro Gly Leu Asp Glu Gly His Val Arg 
220 225 230 

GCA GGA CAG AGC TTG GAG CTG CCG TGC GTG GCC CGA GGG GGT AAT CCC 
Ala Gly Gin Ser Leu Glu Leu Pro Cys Val Ala Arg Gly Gly Asn Pro 
235 240 245 250 

TTA GCC ACA CTG CAG TGG CTG AAG AAT GGC CAG CCG GTG TCC ACA GCG 
Leu Ala Thr Leu Gin Trp Leu Lys Asn Gly Gin Pro Val Ser Thr Ala 
255 260 265 

TGG GGC ACA GAG CAC ACC CAG GCG GTG GCC CGC AGT GTG CTG GTG ATG 
Trp Gly Thr Glu His Thr Gin Ala Val Ala Arg Ser Val Leu Val Met 
270 275 280 

55 ACC GTG AGG CCA GAA GAC CAT GGA GCG CAG CTC AGC TGC GAG GCC CAC 
Thr Val Arg Pro Glu Asp His Gly Ala Gin Leu Ser Cys Glu Ala His 
285 290 295 



10 



15 



20 



25 



30 



35 



40 



45 



50 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 
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AAC AGC GTG TCT GCA GGG ACC CAG GAG CAC GGC ATC ACA CTG CAO GTC 
Ser Val Ser Ala Gly Thr Gin Glu His Gly lie Thr Leu Gin Val 
300 310 

ACC TTT CCC CCT AGT GCC ATT ATT ATC TTG GGA TCT GCA TCC CAG ACT 
S Phe pro Pro Ser Ala He lie He Leu Gly Ser Ala Ser Gin Thr 



315 



320 325 330 



GAG AAC AAG AAC GTG ACA CTC TCC TGT GTC AGC AAG TCC AGT CGC CCG 
10 G?u Sn i^s Asn Val Thr Leu Ser Cys Val Ser Lys Ser Ser Arg Pro 

335 340 

CGG GTT CTG CTA CGA TGG TGG CTG GGC TGG CGG CAG CTG CTG CCC ATG 
A^ val Leu Leu Arg Trp Trp Leu Gly Trp Arg Gin Leu Leu Pro Met 



380 



385 



390 



25 



465 ^■'O 



45 



GTG CAT CTC GGC AGC GTG GAG AAA TCT GGG AGC ACC TTC TCC CGA GAG 
?I? Zl ^y ser Val Glu Lys Ser Gly Ser Thr Phe Ser Arg Glu 
475 480 

r-rr r-rc CTG GTC ACA GGG CCG TCG GAC AAC CAG GCC AAG TTC ACG TGC 
50 Su vTl Zl vll S flv pro ser Asp Asn Gin Ala Lys Phe Thr Cys 

500 



495 



AAC5 GCT GGA CAG CTC AGC GCG TCC ACG CAG CTG GCG GTG CAG TTT CCC 
ijs S Ty Sn Su ser Ala Ser Thr Gin Leu Ala Val Gin Phe Pro 
55 510 515 



1008 



1056 



1104 



1152 



1200 



15 350 355 

GAG GAG ACA GTC ATG GAT GGA CTG CAT GGC GGT CAC ATC TCC ATG TCC 
Oil S val Met ASP Gly Leu His Gly Gly His lie Ser Met Ser 
365 370 375 

AAC CTG ACA TTC CTG GCG CGG CGG GAG GAC AAC GGT CTG ACC CTC ACA 1248 
Leu Thr Phe Leu Ala Arg Arg Glu Asp Asn Gly Leu Thr Leu Thr 



1296 



TGT GAG GCC TTC AGT GAA GCC TTC ACC AAG GAG ACC TTC AAG AAG TCG 
111 Ala Phe ser Glu Ala Phe Thr Lys Glu Thr Phe Lys Lys Ser 

395 400 

CTC ATC CTG AAC GTA AAA TAT CCC GCC CAG AAA CTG TGG ATT GAG GGT 
30 S ne Tsn Val Lys Tyr Pro Ala Gin Lys Leu Trp - He Glu Gly 

CCC CCA GAG GGC CAG AAG CTC CGG GCT GGG ACC CGG GTG AGG CTG GTG 
Pro pS Glu Gly Gin Lys Leu Arg Ala Gly Thr Arg Val Arg Leu Val 
35 430 435 

TGT TIG GCT ATC GGG GGC AAC CCA GAG CCC TCC CTC ATG TGG TAC AAG 
Ss Su SI ne Gly Gly Asn Pro Glu Pro Ser Leu Met Trp Tyr Lys 
445 450 455 

GAC TCG CGC ACC GTG ACC GAG TCG CGG CTG CCG CAG GAG TCG CGG CGC 1488 
Sp ser Sg ?Sr Val Thr Glu Ser Arg Leu Pro Gin Glu Ser Arg Arg 
460 



1344 



1392 



1440 



1536 



1584 



1632 
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ACC GCG GTG GAG CAG GGC GAG GCG TTG CTG CCC GTQ TCC GTQ TCC OCT 
30 ?S Sa Val Glu Gin Gly Glu Ala Leu Leu Pro Val Ser Val Ser Ala 

640 



635 



AAC CCC GCC CCC GAG GCC TTC AAC TGG ACC TTC CGC GGC TAT CGC CTC 
So Ala Pro Glu Ala Phe Asn Trp Thr Phe Arg Gly Tyr Arg Leu 



35 



660 665 



Ss Su V^l ^Ir I^g Ala Asp Asp Gly Leu Tyr Gin Leu His 

690 695 



685 



45 TGC CAG AAC TCT GAG GGC ACC GCG GAA GCG CGG CTG CGG CTG GAC GTG 
Zs Asn ser Glu Gly Thr Ala Glu Ala Arg Leu Arg Leu Asp Val 
700 705 ■ 710 

CAC TAT GCT CCC ACC ATC CGT GCC CTC CAG GAC CCC ACT GAG GTQ AAC 
50 Ss Ala Pro Thr He Arg Ala Leu Gin Asp Pro Thr Glu Val Asn 



715 



720 



GTC GGG GOT TCT GTG GAC ATA GTC TGC ACT GTC GAT GCC AAT CCC ATC 
Zl Gly Gly ser Val Asp He Val Cys Thr Val Asp Ala Asn Pro He 
55 735 740 745 



1728 



CCA ACT AAC GTG ACG ATC CTG GCC AAC GCA TCC GCA CTG CGC CCG GGA 1680 
Pro Thr Asn Val Thr He Leu Ala Asn Ala Ser Ala Leu Arg Pro Gly 
525 530 535 

5 GAC GCC TTA AAC TTG ACA TGC GTC AGC GTC AGC AGC AAT CCG CCG GTC 
ASP Ala Leu Asn Leu Thr Cys Val Ser Val Ser Ser Asn Pro Pro Val 
540 545 550 

AAC TTG TCC TGG GAC AAG GAA GGG GAG AGG CTG GAG GGC GTG GCC GCC 1776 
10 ^n IS ser Trp Asp Lys Glu Gly Glu Arg Leu Glu Gly Val Ala Ala 
555 560 565 570 

CCA CCC CGG AGA GCC CCA TTC AAA GGC TCC GCC GCC GCC AGG AGC GTC 
Pro Pro Arg Arg Ala Pro Phe Lys Gly Ser Ala Ala Ala Arg Ser Val 
15 575 580 585 

CTT CTG CAA GTG TCA TCC CGC GAT CAT GGC CAG CGC GTG ACC TGC CGC 
S Su val ser Ser Arg Asp His Gly Gin Arg Val Thr Cys Arg 
590 595 600 

GCC CAC AGC GCC GAG CTC CGC GAA ACC GTG AGC TCC TTC TAT CGC CTC 1920 
Sa Ss ser Ala Glu Leu Arg Glu Thr Val Ser Ser Phe Tyr Arg Leu 
605 610 615 

25 AAC GTA CTG TAG CGT CCA GAG TTC CTG GGG GAG CAG GTG CTG GTG GTG 
vll Leu Tyr Arg Pro Glu Phe Leu Gly Glu Gin Val Leu Val Val 
620 625 630 



1824 



1872 



1968 



2016 



2064 



2112 



AGT CCA GCG GGC GGC CCC CGG CAT CGC ATC CTG TCC AGC JCT CTG 

ser pro Ala Gly Gly Pro Arg His Arg He Leu Ser Ser Gly Ala Leu 
670 675 

CAT CTG TGG AAT GTQ ACC CGC GCG GAC GAC GGC CTC TAT CAG CTG CAC 



2208 



2256 



2304 
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20 



25 



30 



40 



CTC CCG GGC ATG TTC AAC TCG GAG AGA CTG GGA GAA GAT GAG GAG GAC 2352 
Leu Pro Gly Met Phe Asn Trp Glu Arg Leu Gly Glu Asp Glu Glu Asp 
750 755 760 

5 CAG AGC CTG GAT GAC ATG GAG AAG ATA TCC AGG GGA CCA ACG GGG CGC 2400 
Gin Ser Leu Asp Asp Met Glu Lys He Ser Arg Gly Pro Thr Gly Arg 
765 770 775 

CTG CGG ATT CAC CAT GCC AAA CTG GCC CAG GCT GGC GCT TAC CAG TGC 2448 
10 Leu Arg He His His Ala Lys Leu Ala Gin Ala Gly Ala Tyr Gin Cys 
780 785 790 

ATT GTG GAC AAT GGG GTG GCG CCT CCA GCA CGA CGG CTG CTC COT CTT 2496 
He Val Asp Asn Gly Val Ala Pro Pro Ala Arg Arg Leu Leu Arg Leu 
15 795 800 805 810 

GTT GTC AGA TTT GCC CCC CAG GTG GAG CAC CCC ACT CCC CTA ACT AAG 2544 
Val Val Arg Phe Ala Pro Gin Val Glu His Pro Thr Pro Leu Thr Lys 
815 820 825 

GTG OCT GCA GCT GGA GAC AGC ACC AGT TCT GCC ACC CTC CAC TGC CGT 2592 
val Ala Ala Ala Gly Asp Ser Thr Ser Ser Ala Thr Leu His Cys Arg 
830 835 840 

GCC CGA GGT GTC CCC AAC ATC GTT TTC ACT TOG ACA AAA AAC GGG GTC 2640 
Ala Arq Gly Val Pro Asn He Val Phe Thr Trp Thr Lys Asn Gly Val 
845 850 855 



CCT CTG GAT CTC CAA GAT CCC AGG TAC ACG GAG CAC ACA TAC CAC CAG 
Pro Leu Asp Leu Gin Asp Pro Arg Tyr Thr Glu His Thr Tyr His Gin 
860 865 870 



2688 



GGT GGT GTC CAC AGC AGC CTC CTG ACC ATT GCC AAC GTG TCT GCC GCC 2736 
Glv Gly Val His Ser Ser Leu Leu Thr He Ala Asn Val Ser Ala Ala 
35 875 880 885 890 

CAG GAT TAC GCC CTC TTC ACA TGT ACA GCC ACC AAC GCC CTT GGC TCG 2784 
Gin Asp Tyr Ala Leu Phe Thr Cys Thr Ala Thr Asn Ala Leu Gly Ser 
395 900 

GAC CAA ACC AAC ATT CAA CTT GTC AGC ATC AGC CGC CCT GAC CCT CCA 2832 
Asp Gin Thr Asn He Gin Leu Val Ser He Ser Arg Pro Asp Pro Pro 
910 915 920 

45 TCA GGA TTA AAG GTT GTG AGT CTG ACC CCA CAC TCC GTG GGG CTG GAG 2880 
ser Gly Leu Lys Val Val Ser Leu Thr Pro His Ser val Gly Leu Glu 
925 930 

TGG AAG CCT GGC TTT GAT GGG GGC CTG CCA CAG AGG TTC TGC ATC AGG 2928 
50 Trp Lys Pro Gly Phe Asp Gly Gly Leu Pro Gin Arg Phe Cys He Arg 
940 945 950 

TAT GAG GCC CTG GGG ACT CCA GGG TTC CAC TAT GTG GAT GTC GTA CCA 2976 
Tyr Glu Ala Leu Gly Thr Pro Gly Phe His Tyr Val Asp Val Val Pro 
55 9I5 960 965 970 
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ill 



10 



TAT GAC CAG GTG GCC GGA GAG TTG GAG ACT CTG GAA CCC GAT TCT CTG 3696 
Tyr Asp Gin Val Ala Gly Asp Leu Asp Thr Leu Glu Pro Asp Ser Leu 
1195 1200 1205 1210 

CCC TTC GAG CTG AGG GGA CAT CTG GTG TAAGAGCCCT CTCAACCCCA 3743 
Pro Phe Glu Leu Arg Gly His Leu Val 
1215 

TTGTCCTGCA CCTGCAGGAA TTTACACTCC ACTGGTCTCT CTCATTACAG CCTGGGCCGA 3803 

GCTGGTTAGG TGAGCTCCAT AAAACCCAAA GGGACTTGGT GTCAGGAGAG GACATGGAGG 3863 

GGGCTGAGTG ACAGAGATGG TTCAGCTGGT ACCAGAGTAG AAACAAGGTG CATCCTGGGG 3923 

15 TTGGCTTTAG AAACTAAACT TCTCCAAAAG GACAGGGCAG ATTGTAAACG TCGTCTCAAA 3983 

AATGAAATGC TGCCGGGTGC GGTGACTCAC GCCTATAATC CCAGCACTTT GGGAGGCTGA 4043 

GGCGGGTGGA TCACCTGAGG TCAGGAGTTC GAGACCAGCC TGGCCAACAT GGTAAAACTC 4103 

20 

CATTTCTACT AAAAATATAA AAAATTAGCC AGGAGTAGTG GCGCATGCCT GTAGTCCCAG 4163 

CTACTTGGGA GGCTGATGCA TGAGAATTGC TTGAACCCAG GAGGCGGAGG TTGCAGTGAG 4223 

25 CTGAGATCAC GCCACTGCAC TCCAGCCTGG GCGACAGAGC GAGATTCTGT CTCAAAAAAT 4283 
AA 



30 (2) INFORMATION FOR SEQ ID N0:2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 amino acids 
ni (B) TYPE: amino acid 

fll 35 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



40 



Met Ala Leu Gly Thr Thr Leu Arg Ala Ser Leu Leu Leu Leu Gly Leu 

ft -22 -20 -15 -10 

Leu Thr Glu Gly Leu Ala Gin Leu Ala lie Pro Ala Ser Val Pro Arg 
45-5 1 5 10 

Glv Phe Trp Ala Leu Pro Glu Asn Leu Thr Val Val Glu Gly Ala Ser 
^15 20 25 

Val Glu Leu Arg Cys Gly Val Ser Thr Pro Gly Ser Ala Val Gin Trp 
30 35 40 

Ala Lys Asp Gly Leu Leu Leu Gly Pro Asp Pro Arg lie Pro Gly Phe 
45 50 55 

Pro Arg Tyr Arg Leu Glu Gly Asp Pro Ala Arg Gly Glu Phe His Leu 
60 65 70 



50 



55 



4285 
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His He Glu Ala Cys Asp Leu Ser Asp Asp Ala Glu Tyr Glu Cys Gin 
75 80 85 90 

5 Val Glv Arg Ser Glu Met Gly Pro Glu Leu Val Ser Pro Arg Val He 

95 100 105 

Leu Ser He Leu Val Pro Pro Lys Leu Leu Leu Leu Thr Pro Glu Ala 
110 115 120 



10 



15 



Gly Thr Met Val Thr Trp Val Ala Gly Gin Glu Tyr Val Val Asn Cys 

135 



125 



130 



Val Ser Gly Asp Ala Lys Pro Ala Pro Asp He Thr He Leu Leu Ser 

145 150 



140 



Gly Gin Thr He Ser Asp He Ser Ala Asn Val Asn Glu Gly Ser Gin 



165 



155 160 
20 Gin Lys Leu Phe Thr Val Glu Ala Thr Ala Arg Val Thr Pro Arg Ser 



175 



180 



185 



Ser Asp Asn Arg Gin Leu Leu Val Cys Glu Ala Ser Ser Pro Ala Leu 



190 



195 



200 



25 



Glu Ala Pro He Lys Ala Ser Phe Thr Val Asn Val Leu Phe Pro Pro 
205 210 215 

Gly Pro Pro Val He Glu Trp Pro Gly Leu Asp Glu Gly His Val Arg 
30 220 225 230 

Ala Gly Gin Ser Leu Glu Leu Pro Cys Val Ala Arg Gly Gly Asn Pro 



245 



250 



235 240 
35 Leu Ala Thr Leu Gin Trp Leu Lys Asn Gly Gin Pro Val Ser Thr Ala 



255 



260 



265 



Tro Gly Thr Glu His Thr Gin Ala Val Ala Arg Ser Val Leu Val Met 

275 280 



270 



40 



45 



Thr val Arg Pro Glu Asp His Gly Ala Gin Leu Ser Cys Glu Ala His 
285 290 295 

Asn Ser Val Ser Ala Gly Thr Gin Glu His Gly He Thr Leu Gin Val 
300 305 310 

Thr Phe Pro Pro Ser Ala He He He Leu Gly Ser Ala Ser Gin Thr 
315 320 325 330 

50 Glu Asn Lys Asn Val Thr Leu Ser Cys Val Ser Lys Ser Ser Arg Pro 

335 340 345 

Arq val Leu Leu Arg Trp Trp Leu Gly Trp Arg Gin Leu Leu Pro Met 
350 355 360 



55 



Glu Glu Thr Val Met Asp Gly Leu His Gly Gly His He Ser Met Ser 
365 370 375 
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Asn Leu 
380 



Thr Phe Leu Ala Arg Arg Glu Asp Asn Gly Leu Thr Leu Thr 
385 390 



5 cys Glu Ala Phe Ser Glu Ala Phe Thr Lys Glu Thr Phe Lys Lys Ser 

I 405 



395 



400 



410 



Leu He Leu Asn Val Lys Tyr Pro Ala Gin Lys Leu Trp lie Glu Gly 
415 420 425 

Pro Pro Glu Gly Gin Lys Leu Arg Ala Gly Thr Arg Val Arg Leu Val 
430 435 440 

CVS Leu Ala He Gly Gly Asn Pro Glu Pro Ser Leu Met Trp Tyr Lys 

455 



450 



15 445 

ASP ser Arg Thr Val Thr Glu Ser Arg Leu Pro Gin Glu Ser Arg Arg 



460 



465 



20 val His Leu Gly Ser Val Glu Lys Ser Gly Ser Thr Phe Ser Arg Glu 
475 480 485 



25 



30 



Leu Val Leu Val Thr Gly Pro Ser Asp Asn Gin Ala Lys Phe Thr Cys 
495 500 505 

Lys Ala Gly Gin Leu Ser Ala Ser Thr Gin Leu Ala Val Gin Phe Pro 



510 



515 



pro Thr Asn Val Thr lie Leu Ala Asn Ala Ser Ala Leu Arg Pro Gly 

530 535- 



525 



ASP Ala Leu Asn Leu Thr Cys Val Ser Val Ser Ser Asn Pro Pro Val 

545 550 

ser Trp Asp Lys Glu Gly Glu Arg Leu Glu Gly Val Ala Ala 



540 



35 Asn Leu 



555 560 565 



40 



Pro Pro 



Leu Leu 



Arg Arg Ala Pro Phe Lys Gly Ser Ala Ala Ala Arg Ser Val 

- * 585 



575 



580 



Gin Val ser Ser Arg Asp His Gly Gin Arg Val Thr Cys Arg 



600 



590 595 
Ala His ser Ala Glu Leu Arg Glu Thr Val Ser Ser Phe Tyr Arg Leu 



615 



45 605 

val Leu Tyr Arg Pro Glu Phe Leu Gly Glu Gin Val Leu Val Val 



Asn 



620 



625 



630 



50 Thr Ala Val Glu Gin Gly Glu Ala Leu Leu Pro Val Ser Val Ser Ala 

^ J /s 645 



635 



Asn Pro 



640 



Ala Pro Glu Ala Phe Asn Trp Thr Phe Arg Gly Tyr Arg Leu 



655 



660 



665 



55 



Ser pro Ala Gly Gly Pro Arg His Arg lie Leu Ser Ser Gly Ala Leu 

675 



670 
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10 



15 



20 
25 

"r- 

Is 

m 30 

|:4: 35 

40 



45 



50 



His Leu Trp Asn Val Thr Arg Ala Asp Asp Gly Leu Tyr Gin Leu His 
685 690 695 

Cys Gin Asn Ser Glu Gly Thr Ala Glu Ala Arg Leu Arg Leu Asp Val 
700 705 710 

His Tyr Ala Pro Thr He Arg Ala Leu Gin Asp Pro Thr Glu Val Asn 
715 720 725 730 

Val Gly Gly Ser Val Asp He Val Cys Thr Val Asp Ala Asn Pro He 
735 740 745 

Leu Pro Gly Met Phe Asn Trp Glu Arg Leu Gly Glu Asp Glu Glu Asp 
750 755 760 

Gin Ser Leu Asp Asp Met Glu Lys He Ser Arg Gly Pro Thr Gly Arg 
765 770 775 

Leu Arg He His His Ala Lys Leu Ala Gin Ala Gly Ala Tyr Gin Cys 
780 785 790 

He Val Asp Asn Gly Val Ala Pro Pro Ala Arg Arg Leu Leu Arg Leu 
795 800 805 810 

Val Val Arg Phe Ala Pro Gin Val Glu His Pro Thr Pro Leu Thr Lys 
815 820 825 

Val Ala Ala Ala Gly Asp Ser Thr Ser Ser Ala Thr Leu His Cys Arg 
830 835 840 

Ala Arg Gly Val Pro Asn He Val Phe Thr Trp Thr Lys Asn Gly Val 
845 850 855 

Pro Leu Asp Leu Gin Asp Pro Arg Tyr Thr Glu His Thr Tyr His Gin 
860 865 870 

Gly Gly Val His Ser Ser Leu Leu Thr He Ala Asn Val Ser Ala Ala 
875 880 885 890 

Gin Asp Tyr Ala Leu Phe Thr Cys Thr Ala Thr Asn Ala Leu Gly Ser 
895 900 905 

Asp Gin Thr Asn He Gin Leu Val Ser He Ser Arg Pro Asp Pro Pro 
910 915 920 

Ser Gly Leu Lys Val Val Ser Leu Thr Pro His Ser Val Gly Leu Glu 
925 930 935 

Trp Lys Pro Gly Phe Asp Gly Gly Leu Pro Gin Arg Phe Cys He Arg 
940 945 950 

Tyr Glu Ala Leu Gly Thr Pro Gly Phe His Tyr Val Asp Val Val Pro 
955 960 965 970 

Pro Gin Ala Thr Thr Phe Thr Leu Thr Gly Leu Gin Pro Ser Thr Arg 



975 



980 



985 
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Tyr Arg Val Trp Leu Leu Ala Ser Asn Ala Leu Gly Asp Ser Gly Leu 
990 995 1000 

Ala Asp Lys Gly Thr Gin Leu Pro lie Thr Thr Pro Gly Leu His Gin 
1005 1010 1015 

Pro Ser Gly Glu Pro Glu Asp Gin Leu Pro Thr Glu Pro Pro Ser Gly 
1020 1025 1030 

Pro Ser Gly Leu Pro Leu Leu Pro Val Leu Phe Ala Leu Gly Gly Leu 
1035 1040 1045 * 1050 

Leu Leu Leu Ser Asn Ala Ser Cys Val Gly Gly Val Leu Trp Gin Arg 
1055 1060 1065 

Arg Leu Arg Arg Leu Ala Glu Gly He Ser Glu Lys Thr Glu Ala Gly 
1070 1075 1080 

Ser Glu Glu Asp Arg Val Arg Asn Glu Tyr Glu Glu Ser Gin Trp Thr 
1085 1090 1095 

Gly Glu Arg Asp Thr Gin Ser Ser Thr Val Ser Thr Thr Glu Ala Glu 
1100 1105 1110 

Pro Tyr Tyr Arg Ser Leu Arg Asp Phe Ser Pro Gin Leu Pro Pro Thr 
1115 1120 1125 1130 

Gin Glu Glu Val Ser Tyr Ser Arg Gly Phe Thr Gly Glu Asp Glu Asp 
1135 1140 1145 

Met Ala Phe Pro Gly His Leu Tyr Asp Glu Val Glu Arg Thr Tyr Pro 
1150 1155 1160 

Pro Ser Gly Ala Trp Gly Pro Leu Tyr Asp Glu Val Gin Met Gly Pro 
1165 1170 1175 

Trp Asp Leu His Trp Pro Glu Asp Thr Tyr Gin Asp Pro Arg Gly He 
1180 1185 1190 

Tyr Asp Gin Val Ala Gly Asp Leu Asp Thr Leu Glu Pro Asp Ser Leu 
1195 1200 1205 1210 

Pro Phe Glu Leu Arg Gly His Leu Val 
1215 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECXJLE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "primer exon 2 5'UTR" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

5 GAGAAAGCCA GACAGACGCA G 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "primer intron 2" 



01 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

20 

AGCTTCCGCT GGTGGCT 
(2) INFORMATION FOR SEQ ID NO: 5: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 



50 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "primer intron 23" 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTCGGGGAGA CCCACCC 

H' (2) INFORMATION FOR SEQ ID NO: 6: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid* 

(A) DESCRIPTION: /desc = "primer intron 26" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
CCTGATGCTA ACGGCAGGGC 
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